Course syllabus

Language Technology
Språkteknologi

EDAN20, 7.5 credits, A (Second Cycle)

Valid for: 2024/25
Faculty: Faculty of Engineering LTH
Decided by: PLED C/D
Date of Decision: 2024-04-16
Effective: 2024-05-08

General Information

Depth of study relative to the degree requirements: Second cycle, in-depth level of the course cannot be classified
Elective for: C4-pvs, C4-pvt, D4-pv, D4-mai, E4-bg, F4, F4-pv, F4-mai, MMSR2, Pi4-pv, Pi4-bam
Language of instruction: The course will be given in English

Aim

In the past 15 years, language technology has considerably matured driven by the massive increase of textual and spoken data and the need to process them automatically. Although there are few systems entirely dedicated to language processing, there are now scores of applications that are to some extent "language-enabled" and embed language processing techniques such as spelling and grammar checkers, information retrieval and extraction, or spoken dialogue systems. This makes the field form a new requirement for the CS engineers.

The course introduces theories used in language technology. It attempts to cover the whole field from character encoding and statistical language models to semantics and conversational agents, going through syntax and parsing. It focuses on proven techniques as well as significant industrial or laboratory applications.

Learning outcomes

Knowledge and understanding
For a passing grade the student must

Competences and skills
For a passing grade the student must

Judgement and approach
For a passing grade the student must

Contents

Examination details

Grading scale: TH - (U, 3, 4, 5) - (Fail, Three, Four, Five)
Assessment:

To achieve a grade 3, approved assignment submissions (laboratory work and individual reports) are required. For the possibility of higher grades, a written exam is also required. The assignment (laboratory work) are to be carried out in groups of two students, but can also be done individually. The first laboratory session is dedicated to the programming tools used in the course. The remaining six assignments are addressed in subsequent laboratory sessions.

The examiner, in consultation with Disability Support Services, may deviate from the regular form of examination in order to provide a permanently disabled student with a form of examination equivalent to that of a student without a disability.

Modules
Code: 0113. Name: Statistical Techniques for Text Analysis.
Credits: 3.5. Grading scale: UG - (U, G). Assessment: To qualify for a passing grade the laboratory work must be completed. The module includes: Laboratory work.
Code: 0213. Name: Syntactic and Semantic Processing of Text.
Credits: 4.0. Grading scale: UG - (U, G). Assessment: To qualify for a passing grade the laboratory work must be completed. The module includes: Laboratory work.
Code: 0313. Name: Written Examination.
Credits: 0.0. Grading scale: TH - (U, 3, 4, 5). Assessment: Passing the course with a mark of 3 will consist in passing all the assignments. Optionally, the students will be able to take the written examination and improve their mark to 4 or 5. The module includes: Optional written examination.

Admission

Admission requirements:

The number of participants is limited to: No
Kursen överlappar följande kurser: EDA171

Reading list

Contact

Course coordinator: Professor Pierre Nugues, Pierre.Nugues@cs.lth.se
Course homepage: https://cs.lth.se/edan20